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SEQUENCE PROTOCOL ^ X 

JCOt Rec'd PCT/FTQ 2 2 DFC 



(1) GENERAL INDICATIONS: 

(i) APPLICANT: 

(A) NAME: Deutsches Krebs f orschungszentrum 

(B) STREET: Im Neuenheimer Feld 280 

(C) TOWN: Heidelberg 

(E) COUNTRY: Germany 

(F) POSTA1 CODE: 69120 

(ii) TITLE OF THE INVENTION: Modularly Constructed RNA 

Molecules Having Two Sequence Region Types 

(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER-READABLE VERSION: 

(A) DATA CARRIER: floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SORT WARE : Patentln Release #1.0, version #1.30 

(EPO) 

(v) DATA OF THE CURRENT APPLICATION: not yet known 

(vi) DATA OF THE PRIOR APPLICATION: 
APPLICATION NUMBER: DE 198 28 624.4 
FILING DATE: June 26, 1998 

(2) INDICATIONS AS TO ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8422 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 

(ii) KIND OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTTAGAGTTT CGTGGCTTCA GGGTGGGAGT AGTTGGAGCA TTGGGGATGT TTTTCTTACC 60 

GACAAGCACA GTCAGGTTGA AGACCTAACC AGGGCCAGAA GTAGCTTTGC ACTTTTCTAA 120 

ACTAGGCTCC TTCAACAAGG CTTGCTGCAG ATACTACTGA CCAGACAAGC TGTTGACCAG 180 

GCACCTCCCC TCCCGCCCAA ACCTTTCCCC CATGTGGTCG TTAGAGACAG AGCGACAGAG 240 

CAGTTGAGAG GACACTCCCG TTTTCGGTGC CATCAGTGCC CCGTCTACAG CTCCCCCAGC 300 

TCCCCCCACC TCCCCCACTC CCAACCACGT TGGGACAGGG AGGTGTGAGG CAGGAGAGAC 360 

AGTTGGATTC TTTAGAGAAG ATGGATATGA CCAGTGGCTA TGGCCTGTGC GATCCCACCC 420 

GTGGTGGCTC AAGTCTGGCC CCACACCAGC CCCAATCCAA AACTGGCAAG GACGCTTCAC 480 

AGGACAGGAA AGTGGC AC CT GTCTGCTCCA GCTCTGGCAT GGCTAGGAGG GGGGAGTCCC 540 

TTGAACTACT GGGTGTAGAC TGGCCTGAAC CACAGGAGAG GATGGCCCAG GGTGAGGTGG 600 

C ATGGTC CAT TCTCAAGGGA CGTCCTCCAA CGGGTGGCGC TAGAGGCCAT GGAGGCAGTA 660 
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A riTPPTTATP 


TARPTOPATA 


TPTTPATPAT 


ATTGGTATAT 


CCTTTTCTGT 


1920 


bill ALAuAb 
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TATPTA A ATP 
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TP.TPPAAPTG 


AGAAGTACCT 
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bLAAA ivjAbA 
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CCTCCAGAAC 
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CTTAGTCAAA 
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CTACTCGTGA 
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CATTGGAGGG 


*i fi fin 


CCGAAGCATG 


AACAGTGCAC 


CTGGGACAGG 


GAGCAGCCCC 


AAATTGTCAG 
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AG AGGG A 1 AL, 


v?AV3V^Vjvxn.V3 J. Vw 


^V«Vii IvTVvVTvVJVv 


3900 


GACCATC TGG 


AATTGGTTTA 


GCCCAAG 1GG 








3960 


my mill it n m 

TCTAACCACA 


GCTCCTTTTC 


CAGAGGA 1 1\- 


O 7A P.TP 7A np.pT 1 
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GTGGAGCCCA 


GGAGTCCCAC 


TCCAAGCCAG 


TV T\ y-T /—V/T/-T TV T» m 

CAAGCCGAA 1 


AGC 1 A 1 Xj l\j 


/ Zo\) 




TTGCCACTTT 


CCAAGTCACT 


GCAAAACCAG 


GTTTTGTTCC 


GCCCAGTGGA 


TTCTTGTTTT 


/ JzU 




GCTTCCCCTC 


CCCCCGAGAT 


tattaccacc 


ATCCCGTGCT 


miltfll TV TA /™T TV TV TV 

TTTAAGGAAA 


GGCAAGATTG 


73o0 




ATGTTTCCTT 


GAGGGGAGCC 


AGGAGGGGAT 


GTGTGTGTGC 


AGAGCTGAAG 


AGCTGGGGAG 


7440 


M 


AATGGGGCTG 


GGCCCACCCA 


AGCAGGAGGC 


TGGGACGCTC 


TGCTGTGGGC 


ACAGGTCAGG 


7500 


fit 


. CTAATGTTGG 


CAGATGCAGC 


TCTTCCTGGA 


CAGGCCAGGT 


GGTGGGCATT 


CTCTCTCCAA 


7560 


ru 


GGTGTGCCCC 


GTGGGCATTA 


CTGTTTAAGA 


CACTTCCGTC 


ACATCCCACC 


CCATCCTCCA 


7620 


GGGCTCAACA 


CTGTGACATC 


TCTATTCCCC 


ACCCTCCCCT 


TCCCAGGGCA 


ATAAAATGAC 


7680 


5 


CATGGAGGGG 


GCTTGCACTC 


TCTTGGCTGT 


CACCCGATCG 


CCAGCAAAAC 


TTAGATGTGA 


7740 


a_ i 


GAAAACCCCT 


TCCCATTCCA 


TGGCGAAAAC 


ATCTCCTTAG 


AAAAGCCATT 


tx r"t /^i *^m/~i tv mm tv 

ACCCTCATTA 


/oUU 




GGCATGGTTT 


TGGGCTCCCA 


AAACACCTGA 


CAGCCCCTCC 


CTCCTCTGAG 


AGGCGGAGAG 


/ obU 


f 3 | 


TGCTGACTGT 


AGTGACCATT 


GCATGCCGGG 


TGCAGCATCT 


/-T/— 1 ■» TA TV plpimji 

GG AAG AGC T A 


GGCAGGGIxjI 


Toon 
/ y z u 




CTGCCCCCTC 


CTGAGTTGAA 


GTCATGCTCC 


CCTGTGCCAG 


C C- C» AvjA\jGL-\- 


Vj AbA IjC 1 A 1 AJ 


/you 




/-^ tv t\ tx mmo 

GACAGCATTG 


CCAGTAACAC 


AGGCCACCCT 




/t j\ pprppppfnp 




OUftU 




ACCTGTCTGA 


GGTTGGGAGA 


GGTGCACTTG 








O J. VJ VJ 




CTGGAGATGT 


CTCTAAAAGC 


CCTGTATCGT 


ATTCACCTTC 


AGTTTTTGTG 


TTTTGGGACA 


8160 




ATTACTTTAG 


AAAATAAGTA 


GGTCGTTTTA 


AAAACAAAAA 


TTATTGATTG 


CTTTTTTGTA 


8220 




GTGTTCAGAA 


AAAAGGTTCT 


TTGTGTATAG 


CCAAATGACT 


GAAAGCACTG 


ATATATTTAA 


8280 




AAACAAAAGG 


CAATTTATTA AGGAAATTTG 


TACCATTTCA 


GTAAACCTGT 


CTGAATGTAC 


8340 




CTGTATACGT 


TTCAAAAACA 


CCCCCCCCCC 


ACTGAATCCC 


TGTAACCTAT 


TTATTATATA 


8400 




AAGAGTTTGC 


CTTATAAATT 


TA 








8422 



(2) INDICATIONS AS TO ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 64 amino acids 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 
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(ii) KIND OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



pqvn A pi A orrvpm 


LL? 1 VJV5L 1 IV VJ 


vsxjvj 1 VLjLjALs 1 


7A P TTYIP. 7A C1T* 7A 
Avj 1 ITjuauCA 


1 lxjijijAlvjl 1 


X X XV X InLV-u 


fin 


a p a a pp a p a p 


TP APJ^TTP-A A 
XV AVjvj 1 1 VjAA 


rtl7APPT7A.7A.PP7i 


nnnc p 7A p ta a n 


X rivjV-. X X iuLn 


PTTTTPTAAA 


120 


L 1 ALjuL IVL 1 


rpp> a A p> A 7V pjpjp* 
IVAALAALjLtL 


TTPPTPP AP A 


T 7A PT 7A PTP 7A P 
1 AL- 1 Av_ l\j AV- 


PAPAP A APiPT 
LAonL AAVjL ± 


nTTTSAPPAnP. " 

V3 X 1U/1V-V-/1VJU 


1 AO 

X O \J 


LAL 1VLLLLL 


a a p« a am^TVP 
AAUAA 1 A IVL 


nv P* P 1 TV 1 TTPP P* 
IVLL 1 V- 1 lv-L 


r*r , r , r*r , r > r , r t 7a p 




TPTYlPTPfiTT 
l\j 1 LtL ILvj 1 1 




AGGGCAATTG 


a a a /t"» a o a r«ni 
AAAGGAL AL I 


CCCATTTTTG 


UluLCAi l\iA 


rpp!pippimp»mpip» 
l\iLLL xL» ILL 


7A rp 7A 7A T* A PPTT 
A 1 AA 1 ALt/L X 1 


inn 


CCCTGACTTT 


TACACCACCC 


LAAL\IvllaA 


TCTGAAG(jAC 


xLiviVj ALaXj IXj 1 


La A IVTLAL3L3AL 


i £n 


AAACTATGGG 


ACTCTTGGGA 


4~\ tv a a y™»m tv rrv** 

GAAGACTATG 


GAGTTGGCCA 


y~vrrv~« a mm a a /*t/~i 

GTGATTAAGG 


CllAlIaAI 1 


4ZU 


CCAACTGTGG 


TAGCACAGAT 


CTGGCTCCAC 


a m/^ a a r^r\r> a a 

ATCAACCCAA 


m/""* /^v A AAA rtm/T 

IllAAAACTG 


tv tv a i^f* a m a rp 

aCAAGvjATAT. 


A O A 


TTTGCAAAAA 


AAGAAAGTGG 


CACCTGTCTG 


ATC CAGCTCT 


r> a a nv/Tom a 

GACATGGCTA 


GAGGTGAGTl 




CTAAACTGAT 


GGCTTATAAA 


CTAGCCTGAG 


/^l f~\ TV TV /^l TV TV ^"1 TV 

CCACAGAAGA 


GTATGGCCCA 


/-~» tv ^rnr' a a /"»nv* 

GAGTGAAGTG 


can 


TCATCATCTG 


TTCACAAGGC 


ATGCTCCCCT 


TV S*l TV TV ^T TV ITT. TV TV ITI 

AGAAGATAAT 


f*\f~%m TV TV TV TV Z - ^ /^t 

GCTAAAGAGG 


TGllATGGAG 


CCA 


GCAGCAGGAC 


AAAGTACAGG 


CAGGCTAGGT 


GGAGTCAAGC 


/tti y-*i yt yn /^m tv /"^i m 

CAGGCCTAGT 


GClAlAGAAL 


*7 *> n 


AAGAGAGCAG 


TCTGACTAGT 


A A IIW HA A TV /""»/""■« 

AATTAAGAGG 


Tv Tv AAA i~*t~* A 

GAAGAAAGGA 


AAA rp a iT»rp/^rrwTi 
AAA 1 Al 1 Lx 1 


P*P 1 A 7A rprp A purvp 

LLAA1 1AL1 1 


n on 


lLLAb 1 Iv Iv 


L x 1 lAWibAL 


7AP*P ,r P r P7AP , 7A A rp 

ALiL 1 1 ALxAA 1 


T»7V qWT»TY!P A 

1A1 1 loLALi 


A 1 Ivjiiu IV 1 1 


P ATY^TTPPP A 


O *± V/ 


ptrprpp* A A A A P* A 

L 1 ILAAAA.LA 


7A A PAP aTY^PT 
AAL. Avj A 1 ajL 1 


prrvpi A A A pJp» A A 
V— IV AAALaVAA 


7A PTPJ^PTTPP* 7A 
AL 1 Ajvrt— 1 1\jA 


A ATPP-TP AP A 
An. X \ jvj lunV-n 


PTfiTPPPAPA 

V_ X V7 X V^V^V-.AV-/i 


900 


A ppp A PP & P_ A 
ALtL LAL UAuii 


P A TPPP A P.TP 
LA xVLtVAvj IV 


TTP APA APT A 


PPTP.T A TPTfi 


XAX A. X rVV^Vj X V3 


PnPTTGTTTT 

WJ\« X X VJ X X X A 


960 




P A (7T 7A P A T A P. 
L ALaVAL A 1 AVj 


P ATTPPP A AP 


A APrPTPPHA A 


tw*. x v- x nnvj x vj 


TTTGCTGCAA 


1020 


mnvpm»m a A PIP" 


A PTTPPTP AT 
AL 1 xVjAl 


tp^tttptpt 

X\JJA_- X X XV X X 


p»mpipiqy- ,, P , T v PP 


V-X&X X 1V.1 1 VrV. 


X X V-.V^ X 1 Vwtl 


1080 


1 1LA I\jL 111 


pi a ix*rpmp«rprppipi 


PP r PAPPT u T 1 P r P 


X 1X3 X X XV X 


i. Vw X VJ X X\«.V-~rYV7 


PrPAr4CTGCAG 

V7\-nv3\- x vjvnw 


1140 


mp.prrip« a a pp» a. 
1 viC 1\jAALLA 


Pifp(lT"PTAPP 
LA 1 vjL? 1 1 ALL 


rp A A P A PiP» A Pfp 

1 AALAoV-Au 1 


P A PPTYIP APP 
LnoL X W nut 


PPTAPPATTP 


TTPPTGCCCT 

X 1 V. V> X VJV.v.\- 1 


1200 


1 1 AAL 1 lvLL 


A rprppy"ip» a p»rpp» 


PP a PiPT 1 7A TP 7A 
C V_ AVjij 1 A 1X-A 


rp a rpxpiTiA Ap«p«rp 
1A1 1 I nALL I 


TPAPPA APAP. 
X\3AL7L~rLrLvxn.V7 


PTCfifiPTPTT 

\_ XV3V7V3V. 1V.1 i 


1260 


TTGAGC CCTC 


CCTAALL lv T 


r*irnf A AP»7V A O 7V 
iuAAbAAbA 


A P A 7AP7A A pvprp 
ALAAuMuu 1 


AP-PA APPTPT 1 
AVjLaAALjL IV 1 


TP!PTP 4 PTY2p r P 
1 UL XV X X 


X J >£• V/ 


AAGAAAAATG 


TCAAAAGGCT 


TTCAGACCTT 


AAACAATGAG 


CCTTTTCACC 


TTTTACTCTA 


1380 


GAAAAGTGGA 


CTAGAAAATC 


TGGGTCACAT 


TGGGTAGCTG 


AAGGAGATAC 


AGAGGCCCCT 


1440 


ATGGCCTGCC 


AGAGTCGTTG 


CATGGCCCAA 


CAGGGGCTCC 


ATGCCCACTA 


CCCTTGACCC 


1500 


TACTCAGAAA 


TCTAATGTCA 


TACTTAGTGT 


GGGCAGGGGA 


CCTGTCAGGA 


CAGATGCAGA 


1560 


CCTAAGCAGG 


GAGTGACACC 


AGGGCCCTTG 


GCCCTTCTTC 


TGACAAACAT 


ACACATCCCA 


1620 


AGTCTTTTTC 


TAGTGGAATT 


CTTAACCTCT 


TGCTCACTGG 


GGACTGGGAA 


GCATCAGCAC 


1680 


ATCCCATATT 


TCAAACTCTG 


CTCCATAAGT 


ACAGTGGTGA 


ATTTTATAGA 


CTTGACTTTG 


1740 


CTGTGGGGTT 


TTAATTGGTC 


AGTTTTAATT 


TGGGATCCCA 


AAGTTTTAAC 


CTCCATTCAG 


1800 


GAAGTCCTTA 


TCTAGCTGCA 


TATCTTCATC 


ATATTGGTAT 


ATCCTTTTCT 


GTGTTTACAG 


1860 
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AGATGTCTCA TATCTATCGA AATCTGTCTG AGAAGTACCT TATCAAAGTA GCAAATGAGA 1920 

CAGCAGTCTT ATGCTTCCAG AAACACCCAC AGGCACGTCC CATGTGAGCT GCTGCCATGA 1980 

ACTGTCGAGT GTGTATTGTC TTGTGTATTT TCGTTAACGT TCCCCAGCTT CCTTCCTGCG 2040 

GTGTAATCAT GGAAGAGTGA AACATCATAG AAATCGTCTA GCACTTCCTG GCCAGTCCTT 2100 

AGTGATCAGG AACCGTAGTT GACAGTTCCA ATTGATAGCT TAAGATAAAA CCATGTTTGT 2160 

CTCTTATGGA ATGGTTAGAA CTAAGTGAGA GATCTTGCCC CATTCTGTTT GCC GAATC AT 2220 

AGTTGGACTT TTAGTGTATT TGTATCCATT TCCTTGTGCT ATAAAAGCAA ACCCTGCAAC 2280 

CAGCTTTCTG TCAGGCAGTC CTTTTGCCTG CTCTGCTTTT GATCCTCTTA GTCTTGCTTC 2340 

TGGTTCCTCC CTGGAGAGGG AGGAGGGGTC AGAAGAGGAA TTCTGGAGGA TCCAGGATAT 2400 

GTCCTTCTGA ACTCCTGCTT CTTCCAGTGA CAAAAGGCCC CTACTGCCCC ACCCCAACCT 2460 

p GCCCCATGCA CTCCTCTAGG ACACCTTTCC ATACTTTTCA CAACACCTAG CCAGGTTGAC 2520 

? s *j ACCAAGTTGT TTATTGTGGT CTGCTTGGAA TTTTACCTGT TAGGCTTACT TAGTCCAATC 2580 

fll AAATGGACTC CAAGTTGGGT ATCCCTCATC TTTGGAAGAC AACCTAGGCT GATTAGATAT 2640 

fa: 

fll TTACTTTTGG GATTGCAGCA CTTTGGGTGC CGTTTTTCTT TTACTTGGGT TTTATCTGCA 2700 

GCTCCCTCAC CACCACCACC ACCCCCCACT TACCTGTATG TAGAACTGAT TTCAAAACTG 2760 

y s 

1 CAGGTGGTGG TAACTGCAGC TTCTTAGGGT TTTCTTCACT TCTTGCTTCT TTCCCCATTC 2820 

H CCTCATCCAC AAATAAGGGC ATCACAAGTC AGTCTCCTTT AAGCAGGCAG CTTTGGTGGG 2880 

U GTTTTTCCCC TGGAAGCCAG GGACCCTGTC AGGCTGCCTC TGCCTTGTGG TCAGGTTGAC 2940 

III AGGAGGTTGG AGGGAAAAGC CTTAAGTCAT GGGATTCTCA CCAGCTGTGT CTGGCTCAGA 3000 

H CCTGGAATGT GACCTTTATT TTGTTGTATT TGAACATTGT AAAGTGTGGG TGGTACCTTA 3060 

AACTGAATAT GTGAAGAATC C AGAAAC TG A CCAACAGCTT TCAGATACCT GGGGCTAGGT 3120 

CACTAAGGTC ACATCCAGTC TTCCCTACCC TGTTCTAGTT GTTAGCTACT ACCTCTCCCA 3180 

GATAGATTGC TGTATATCCT CCAACTATGA TCATCCTGGC CCAAGCTTGC CTGTTCTTGA 3240 

GTCTGTCTTA ACCAGTGGAA CTGCTGCCCT TGGTGTGCAG TGAGTTGAGG ACTCTTGGTC 3300 

ACAGCCAGGC TCTAGTAGTA CAGCTCCTTT CTGCTGGTGC TGTATTTC C A TATCAAAAGG 3360 

CACAGGGGAG ATCTAGAAAT GCCATCTCCC CCAGTCCATC AGTGCCAAAC AAGCCCATGA 3420 

TCCCAGCATG GGTACAGACA ACTCTGTTCA GTGCTATCAC AACAGACTAG AGGCCATGAA 3480 

CATTGGACGT GGGAACCAGA GCAACCCGAA TTGCTGCTGC TTTATTCAGC TTTCCGTTGC 3540 

TCTGACAATG ATAAAACAAG GCAGTAACTT AAAACAGACT GCCAGGTTTG GCAGAGAAAG 3600 

GAAATTCCTT AGCTGACAGC ACCTCTGGAT TTTAAATAGG TTGTAATAAG TGGCTCAAAC 3660 

CCATCCAGGA AAAAGCAAAA GGGTTAGAAC TGACCAGATG AGACCAGCCT GATTTCATGC 3720 

AGCCCAAATG GAGTCCAGCT GTCTGAACTC TGCAGCACTT CTCTACTACA GTCTCCTAGA 3780 

GCATTCCAGC CAGGCTCTTC AGGCTGAGGA GACATCACAG GTGCCAGTTC TTCAAGAAGA 3840 

CTTTTGTGCA TCAGTTCATA GCCTATATCT TTGCCCAAGA TTGTAGATTC AGGTTAACAC 3900 
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t a p a p a ttpt 


a nnnp a p a tp 


a ptp a p A PTP 


AOAAA AAAAH 


PPPPTY^TY^TIA 


PTP.TPP.TATA 


J J V \J 


PPP A A PT A P A 


A A A APTP Zk A P 


r_pnppT a ppp 


P A CI A TYlPPf^P 
LiiVjn XVtA— v—Vj\— 


A TtTIP PTP 21 TYI 


PPAHAP.PPAA 






PPATPPAPAT 
\— V— 1 UWiV,n 1 


PPTTTTPTPP 
V_V— X X X XV— X Vjy\j 


I— X V-V- X X V— X 1 v_ 


V— X \JV— X \— X V— X Vj 


PTTP AP.TG AA 

V_ X X U/iO X VJAn 


*i V O V 


*w LAo^, V— V— .rtA— 


TPTP A A P A P A 


X x x 0 x xun X X 


PTPTPPTaTT"? 

\— X\— XXX 


X X A X \J X V— XXX 


\— X V— X X X X AW 




X rW— X ±\X r\ 1 JWj 




PTP T A A TTV^T 
Vj X V- X rirt X X\J X 


T 1 A T A A A WTCIH 


TAP A AT APTP. 


V— V— X \— V— V— *— V— flU 


4200 




TUTHTPPTA A 


AGGGGAAAAC 


TTGAACACTG 


A A APP APTTP 
Afxri\—.\— xiVj 1 


TP A A P A A TTT " 
x \jr\J^\^.t\t\ x x x 


4260 


nVjAn\j\j/^Arxrl 


PPTTPAA21 AP 


ATTTAACAAA 


AAATTATATT 


TTZV ATPTTTA 
X XAn.X\7X X Xn. 


TP A AT A APAP 


4320 


VJ7/\ljKjr\- i 1 1 lVJ 


A A A A A ATPTT 


GATCTATAAA 


TACTTACTTT 


AnfiPPTf?Af!f$ 

nUOUv X V3AV7VJ 


TPTPTAATPA 


4380 


\j loAAL l\iAlj 


/1 A A TV^/l/^' A ap 
LAn 1\jOGA/4\- 


TCAAGGCTGA 


AGCCTCCTGC 


ATPAPAPPAP 


P.TAPA APPAP 


AddO 


CjALjCL. IX_ I x\j 


A /"* A mnvnp a pp 
AGA1 1 IXjAGG 


TGTTTTAGCA 


TTGGAAAGCC 


AL 1\- 1 1 ITjVjO 


TanpiY^rjTP 

X AV?l- l\j\ji-\-L. 




C AG aaac tal 


I IX- lvjALt 1 1 


GTCATTTGGA ATGG AGGTT A 


1 vjrtj IX- IXid- 


A a TY2/" , r' & A A 


ftjou 


GCTGCATGAG 


tv t\ /*»OnV**fTVTl 

ACCAGC, IX- 1 I 


GGTTTATCAA 


TTTGAACACT 


PSPTTi A PPTITi 

VJAvj 1 MLL 1 A 


ljAAljrtii»,L.v--ft > o 


Aeon 


CACAAAGTGT 


CTGCTC IX- 1 1 


CTTAACTGAG 


CCTGCCCCAG 


LAL iAL IXiCA 


LAAAi IAvjGVj 


*± O OU 


AGGGTCTACT 


TCCTACAGAG 


CATCCCTCCC 


TGGGCCCCCT 


a nv^onvTtrn 
CCCAIX-V- 1 1 1 


Vj 1 AC- 1X„ 1 Ai-i- 


* / 


TACCTGACCT 


TCAGGATCTT 


GGCACATACG 


AAATGGCTGT 


OrTlAOOA A OO A 

GTAGCAAGK-A 


Lil IXjCjCA IXj 


a q nn 


CCCTCCTAAA 


CTTACCCCAG 


AGCCTCTCCC 


TGCCTCCTTA 


a ppPTiPfnpfpp 
AGCL-AG IX- IXj 


IXj 1X^ 1 1 i- 1 


-HlJOU 


GGGGAGGTGT 


m 7v tv p/tOp a rn 

TAGAGCC-CAT 


AGAATGGAGA 


GGAGAAAGAA 


A A O A A A O A 
AAoAuuAAuA 




*± _7 ^ *J 


rn a /tti 7\ A A A A f" 1 
UAbTAAAAAb 


GC IX- IXjrGGAG 


OA A A /"*• A A OO 

GAAACjACAGC 


CTX-C- 1 AtjrCiC 1 


nwpppxp A App 


APP. APTP APP 


4980 






CATCTTGGAG 


TTTAAGAACA 


TTTf5f3APAAn 


TTGPAAATPA 


5040 


J. 1 lvjC IX- V- 


1 luC X \-V- IV 1 


C AC CTTTTAT 


GGGGCCCTGC 


TTAfiPAPTGA 


AAGPAAATGC 


5100 


PPTPA A A 2l(~K~2 


V— /xf\rlOrt.UrVj X X 


TGGCTCCTGC 


CCACTGATAG 


TPCTTTCCCT 

X V^^' XXX \— V— V— X 


GfCAGTGTTTG 


5160 


1 \j 1 \ j 1 X^tIAvj 1 


ppp a a zipptp 


TTCTTCCTGG 


TGACTCTGAT 


TAGATPPAGT 


AACTTAAGAG 


5220 


All loliiluL 


A 1 1 A»» 1 VJV, 


TTTGACTCTT 


CTATTCTGGG 


\— x x x x \ xxx 


GTTn^TTPAGT 

VJ X X X X X »-VJ X 


5280 


1 1 I VjC 1 1 1 1 A 


bill 1\-V- 1 /i 1 


TTTTATTTTA 


TGCACCAACT 


APrAPAPAPAA 
AVjnv x^\— nn 


APPAPTTGAA 

xXVJV— A*.V*J X X a. 


5340 


1 1 1A1A1A1A 


fTl A <T>A A rri A A 

1 A1A1 Al Al A 


TATATATCTG 


TATATTTCAC 


A ATT AT A A AP 


TPATTTTGPT 

X V--*x X X X X >J>— X 


5400 


±Xj lvjAL. tiC LA 


V-AV-Al-AV-AV-A 


AAAAGAAAAA 


CCTTTTAAAA 


TTATAPPTP/T 


TPP TTA ATT A 

X VJV— x x An x in 


5460 


/"> TV t\ m tv mmm/^m 

UAAIAI I IC-I 


O A T» A A A T> A 


GAGTAGGACA 


AGGGAAAAAA 


TTTA A A AAA A 


AAAAAAAAAA 


5520 


TV TV TV TV TV TV. TV TV /T 

AAGAAAAAAC 


ACA IX- TG IX- 1 


GCTGGTCACT 


TCTTCAATCC 


a appapaTPT 


PTP. ATPTTTP 

\J X VJ/t. X V- X X X V— 


5580 


CTCGCGTCTT 


TCAAAGACTT 


CCCTGTGCTA 


AGTGAAGGAA 


GCTCCAGGCT 


GCACCCAGGT 


5640 


TTTGTGCTTT 


GTTTCTCCTC 


TGTTGTGAAA 


GGGGCCCCAA 


GATTCTGGGT 


ACAGGACAGT 


5700 


TCATTTCAGC 


ATGGGGTCAG 


GAGACAAGAG 


CACTCCCTTT 


ACATGCTGAC 


GTACAGAACT 


5760 


TAGTGGGAAT 


AGCCTAGTCC 


CCACCTCTAG 


GGATGGGGAG 


CTAGCATGCA 


TGGGGGTGAC 


5820 


CCAACTCCCT 


CCACCTTTCC 


CTGGCCAGGA 


AGAGCCTGTG 


TACAGTAAGT 


CTGACAAGCT 


5880 


TTCCCCAGTT 


AGCAGGGCTC 


AGAGCATTTA 


AAAACCCTCC 


AAACTTTGCT 


GAGTCTAGGG 


5940 
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ACTAGAGAGA 


AGATAGAAGA 


TTTGGTCTAT 


CTCCAAGGTG 


TGTAAGCTGT 


ACCAGGTAGA 


6000 


ATGCCAGGGA 


CCCCAGAACC 


ACATCCAACA 


GCCCAATGGG 


TCTCCTCCAG 


AAAGTAGTGA 


6060 


AGACTCCAGA 


AACATCCCTT 


TCTCTTCTCC 


CTGCTCCCAT 


GAGTAACTGC 


ATTTGCTTTT 


6120 


GTAATCCTTA 


ATGAGCATTA 


TCTGCTAAAA 


AAAAAAAATT 


AGCTGTAACA 


GTTCTTTTTG 


6180 


CAAAAGGATC 


ATTCTTAAAT 


AATTAAAAAC 


ACCCCCCCCC 


CAAAAAAAAG 


TCCAGAACCT 


6240 


TGTTCTTCCA 


AAGCAGAGAG 


CATTATAATC 


AGGGCCAAAA 


TCTGTCCCAC 


ACCTCTACCC- 


6300 


CATCTCCTCA 


TGATTGCTGC 


TTCTAAGGCC 


AGAATACAGC 


AAAGATATTT 


GTAGGCCCTT 


6360 


TGGGTGACTG 


GGCTACCCTT 


GGAGCTCTTG 


GAAGATGGGC 


TGGGGAAGCC 


TCTGAGACCC 


6420 


TATCCTAGGG 


CCTTGCTCTA 


GGGAGTAATC 


AGTATTAGTA 


GAGTGTCACA 


ACATTATTCC 


6480 


CCAGCCGGCA 


TGAGATGGGG 


GCAGAAGAAG 


CCAAAGGGTT 


GTCTCCACTG 


CTACTTACTT 


6540 


GGCCACTGAC 


AGGTAGGTGA 


CCATGTATGT 


CCATATGCAT 


GTTTTATGGC 


TGATGTGAGA 


6600 


TCAGCACCCA 


AGTTAGCTTC 


ACCTGGTGAC 


CTCTAACCCT 


GCCTGGATGG 


AGCAGGCCAC 


6660 


CTGGTTCAAT 


GTTTCTGGGC 


AGCTGGACAA 


TGGAGTGCAA 


AAGGCTTACA 


GAACTTGAAG 


6720 


CCTTTTC CTT 

A. A. A. A, \— • A- A> 


ACTTTGCTAG 


CACGGCCTCC 


TTTTCCATTT 


GATTTGTCAC 


TGCTTCAGTC 


6780 


AATAACAGCC 


GCTCCAGAGT 


CAGTAGTTGA 


TGAATATATG 


ACCAAATATC 


ACCAGGACTG 


6840 


TTACTCAACG 


TGTGC CGAGC 


CCTTTCCTTG 


TGCTGGGCTC 


CCTGTGTACC 


TGGACACTGT 


6900 


AATGTGTGCT 


GTGTTTGCTC 


TCCTTCCTCT 


TCCTTCCTTG 


CCCTTTCCTT 


GTCTTTCTGG 


6960 


GGTTTTTCTG 


TTGGGTTTGG 


TTTGGTTTTA 


TTTTTCCTTT 


TGTGTTCCAA 


ACATGAGGTT 


7020 


TTCTCTACTG 


GTCCTCTTTA 


ACTGTGGTGT 


TGAGGCTTCT 


ATTTGTGTAA 


TTTTTGGTGG 


7080 


GTGAAAGGAA 


CTTTGCTAAG 


TAAATCTCTT 


CTGTGTTTGA 


AATGAAGTCT 


GTATTGTAAC 


7140 


TATGTTTAAA 


GTAATTGTTC 


CAGAGACAAA 


TGCTTCTAGG 


TACATTTTCA 


TTACAAACAA 


7200 


AGCATTTGAA 


GGGAGGGAAG 


TGGTGAATAA 


GACAAGAGGG 


GCAATCTGAA 


TTGATCCCTG 


7260 


CCCAGATCAG 


CCAGAAGCTA 


CCAAAAGTTA 


AGCACTGGTT 


TTCCATTCCA 


AGTCAAGAGA 


7320 


CTGAAGCTGA 


TGTTTTGCCA 


TTTTCAAAGT 


CAAAGCAAAA 


CCAGCTTTTC 


CACCCAATGG 


7380 


ATTCTTTGCT 


TCTCCTTCCC 


AGATTATTAC 


TACTGCTGTA 


ATAATCTAGG 


AGTGC CAGGA 


7440 


GGGAAAGGAG 


TATTAACACA 


GAGCTGTGCT 


CACTGAGTAT 


GGAAAGGCTT 


GGTCTGAGTT 


7500 


TTCAGGAGGA 


TGACCCACTG 


TGGACATGGG 


GAGAAGACAG 


AAGATAAATT 


AGCCGCTCCC 


7560 


TGCCTAAGAT 


ACCTCTTAAT 


AGATAAGTCA 


AGGCCATGGA 


CATTATTGTC 


TACAAGGCAT 


7620 


GTTTCAAAGA 


CATGACCAGT 


CAGGACACTT 


CTGTC AT AU T 






7680 


CAGTACTAAT 


CTGATATCTC 


TGTTCCCGCC 


ATGCCTGGGG 


GATAAAATGA 


TAGCAGAGAC 


7740 


TCCTTTCCTT 


CAATGTGATC 


TAATTCCCAA 


CAAAATCTGG 


GCCTGAGATA 


CCACCTGTTT 


7800 


CTATGGCAAA 


CATCCTCAGT 


AAAGTGTTAT 


TCTCATTGCA 


GATTGTTCCA 


GC CTAATGT A 


7860 


AGAGGAACAG 


AGCAGTGTTC 


CCTTGGAGCC 


TCATGTGGAC 


AGTTCTACCT 


GTAGTGACCA 


7920 


GTTGGCTATA 


GTAGTTATTA 


GCTGGAACAA 


CCAGACAGGG 


TACATGCCCC 


CTCCAAAATC 


7980 
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CATGTTGTAC TCCCCTCTGC CAGCCAGGGG GGGTGAGATC TGTAGAATAG TGCAGCCAGT 8040 

GACAAGCCAC CTTGTGTTTG TCACCAGCTC AAAAACTCAT CTAAGGTTGG GAGCAGGCAG 8100 

ACAAGGCAGA GAGAAAGATC CAGGACAGAC CTAGCTGGGC TGGAGGGGTC TTGAAAAGCC 8160 

CTCTGTCGTA TTCACCTTCA GTTTTTGTGC TTTGGGACAA TTACTTTAGA AAATAAGTAG 8220 

GTCGTTTTAA AAACAAAATA TTGATTGCTT TTTTGTAGTG TTCAAAACAA AAGGTTCTTT 8280 

GTGTATAGCC AAATGACTGA AAGCACTGAT ATATTTAAAA ACAAAAGGCA ATTTATTAAG 8340 

GAAATTTGTA CCATTTCAGT AAACCTGTCT GAATGTACCT GTATACGTTT CAAAAACACA 8400 

CCCCACTGAA CCCCTGTAAC CTATTTATTA TATAAAGAGT TTGCCTTATA AATTTACATA 8460 

AAAA 8464 



(2) INDICATIONS AS TO ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 

(ii) KIND OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TTGCTGCAGA 


TACTACTGAC 


CAGACAAGCT 


GTTGACCAGG 


CACCCCCCCA 


ATACTCCCCC 


60 


AATGTGCTCA 


TTAGAGATAG 


CAGTTGAGAG 


GACACTCCCA 


TTTTTGGTGC 


CCTGTCCATA 


12 0 


GCTTCCCTGA 


CTCTTCCACC 


ACCCCAACTC 


CCAATCTGAG 


GGACCGGGAG 


GTGCGAGGCA 


180 


GGAAAAATAT 


TGGATTCTTT 


AGAGAAGACT 


AGAGGTGACC 


AGTGACTGTG 


GCCCAGTAAT 


240 


TAGAACTGTG 


GTGGCACAAG 


TCTGGCCCCA 


CATCCACCCA 


ATCCAAAACT 


GATAAGGATA 


300 


TTTTGAAAAA 


CAGGAAAGCA 


GTACCTGTCT 


GATCCAGCTC 


TGGTATAGGT 


AGGAGTGAGT 


360 


CCTGAACTGC 


TGGATTACAG 


ACTGGCTTGA 


GCCACAGAAG 


ATGATGGACC 


AGAGTAAAGT 


420 


ATCATCACCT 


GCTCACAAGG 


CATGCTTCAC 


TAGAGAATAA 


TTCTAAAGAG 


GTGCCATGGA 


480 


GGCAGCAGGA 


CAAGGCACAA 


GCAGTCTGGG 


TGGGGGTCAA 


GCCAGACCTA 


GTGCCACAGA 


540 


ACAAGAGAGC 


AATCTGTGAC 


TAGTAGTTAG 


GGACTTTGTG 


GATGGGACAA 


GGGGCATGGG 


600 


GGAAGAAATG 


AAAATATTCT 


TCCAATTACT 


TTCCAGTTCT 


CCTTTAGGGA 


CAGCTTAGAA 


660 


TTATTTGCAC 


TATTGAGTCT 


TCATGTTCCC 


ACTTAAAAAC 


AAACAGATGC 


TCTGAAAGCA 


720 


AACTGGCTTG 


AAATGGTGAC 


ACTTTGTCCC 


ACAAGCCACC 


AAATGTGGCA 


GTGTTTAGAA 


780 


CTACCTGGAT 


CTGTATATAC 


CTG 








803 



(2) INDICATIONS AS TO ID NO: 4: 



• 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 



(ii) KIND OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



TTGCTGCATA 


TACTACTGAC 


CAGACAAGCT 


GTTTATCAGG 


CTTTTTAGGG 


TACACCAGCA 


60 


CCTGCCCTCC 


ATTCATCCCT 


GTTGGGAGAG 


GGATGGTGTA 


CTGGTTGTCA 


CTAGAGACCT 


120 


AACAGAGTAG 


GGTTAGTGGG 


AGCTTACATT 


TTCAGTGCCA 


TTAACATTCT 


AGTCCAAGGT 


180 


CTTAAATTAT 


TATGTTGAGG 


GGTTTTTTTT 


CCCCTGAGGG 


GGCCGGGGGG 


TGGGGGGAGG 


240 


GTTGATTAGA 


TTCCTTAGGA 


AAGAGGGTTG 


AGACAGACAG 


CAGAGCACTG 


AGCAGTTGGC 


300 


ACTAAAGGAG 


ACCTTGACTA 


GGGGCCAGGT 


GGCATCATCT 


AATCCCAAGG 


GGCTCCAAGT 


360 


GAGTATTAGG 


GTGGGGGAAG 


ACATTATAGA 


AGGAATAGAA 


ACAGGATAGC 


TCAGCCTAAA 


420 


GAAGAGCGGT 


TAAAACCCTA 


CCCACCAGGA 


GTTGACTTGA 


AAGAGGCCCC 


TATGGAGGAA 


480 


TCCCCAACCA 


CCAAAAGCAA 


TCTTGAGCTG 


CAGCTGCTTC 


ATTTAGTGGA 


CCTTGTGTAT 


540 


ATCTGGGTGT 


GTATGCACAT 


AGATAGACAG 


TGAGAAAGAA 


AACTGTTCTT CCAGTTCTTT 


600 


TCCAGTGCTA 


CTAGCTTAGG 


GACAGGTTAG 


AACTGTCTGC 


ACAATTGTGT 


GATCATTCCC 


660 


ATTCCCACTT 


CAAAACAAAC 


TG AC TG AG AT 


GTTCAACAGA 


AAACTGGCTT 


CAATGGGTAA 


720 


CATGCCCTTG 


CCACTTACTT 


AAGACACTGG 


TGTGATGGGG 


TTTTGAACTC 


CCTATATTTG 


780 


TAGGTATCTG 












790 



(2) INDICATIONS AS TO ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 

(ii) KIND OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TTGCTGCAGA TACTACTGAC CAGACAAGCT GTTGACCAGG CACCTCCCCT CCCGCCCAAA 60 

CCTTTCCCCC ATGTGGTCGT TAGAGACAGA GCAGTTGAGA GGACACTCCC GTTTTCGGTG 120 

CCATCAGTGC CCCGTCTACC ACTCCCCCAG CTCCCCCCAC CTCCCCCACT CCCAACCACG 180 

TTGGGACAGG GAGGTGTGAG GCAGGAGAGA CAGTTGGATT CTTTAGAGAT GGATGTGACC 240 
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AGTGGCTATG 


GCCCGTGCGA 


tcccacccgt 


GGCGGCTCAA 


ATCTGGCCCC 


ACCCCAGCCC 


300 


CAATCCAAAA 


CTGGCAAGGA 


CGCTTCACAG 


GACAGGAAAG 


TGGCACCTGT 


CTGTTCCGGC 


360 


ATGGCTAGGA 


GGGAGTTGTC 


CCTTGAACTA 


CTGGGTGTAG 


ACTGGCCTAA 


ATCACAGGAG 


420 


AGGATGGCCC 


AGGGTGAGGT 


GGCATGGTCC 


ATTCTCAAGG 


GACGTCCTCC 


AGTTGGTGGC 


480 


ACTAGAGAGG 


CCATGGAGGC 


ACjTAGCjACAA 


GGCACAGCjUA 








GGGCCGAACA 


CAGCGGGGTG 


AGAGGGATTC 


CTCGTCTCAG 


AGCAGTCTGT 


GACCGGTAGT 


600 


TAGGGACTTA 


GTGGACAGGG 


AAGGGGCAAA 


GGGGGAGGAG 


AAGAAAATGT 


TCTTCCAGTT 


660 


ACTTTCCAAT 


TCTACTCCTT 


TAGGGACAGC 


TTAGAATTAT 


TTGCACTATT 


GAGTCTTCAT 


720 


GTTCCCACTT 


CAAAACAAAC 


AGATGCTCTG 


AGAGCAAACT 


GGCTTGAATT 


GGTGACGTTT 


780 


AGTCCCTCAG 


GCCACCAGAT 


GTGATGGTGT 


TGAGAACTAC 


CTGGATATGT 


ATATATACCT 


840 



G 841 

O 

(2) INDICATIONS AS TO ID NO: 6: 

p{ (i) SEQUENCE CHARACTERISTICS: 
ss{ (A) LENGTH: 846 base pairs 

[Jf (B) KIND: nucleotide 

\ll (C) STRAND FORM: not known 

yi (D) TOPOLOGY: not known 

H (ii) KIND OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



TTGCTGCAGA 


TACTACTGAC 


CAGACAAGCT 


GTTGACCAGG 


CACCTCCCCT 


CCCGCCCAAA 


60 


CCTTTCCCCC 


ATGTGGTCGT 


TAGAGACAGA 


GCAGTTGAGA 


GGACACTCCC 


GTTTTCGGTG 


120 


CCATCAGTGC 


CCCGTCTGCA 


GCTCCCCCAG 


CTCCCCCCAC 


CTCCCCCACT 


CCCAACCACG 


180 


TTGGGACAGG 


GAGGTGTGAG 


GCAGGAGAGA 


CAGTTGGATT 


CTTTCGAGAA 


GATGGATATG 


240 


ACCAGTGGCC 


ATGGCCTGTG 


CGATCCCACC 


CGTGGCGGCT 


CAAGTCTGGC 


CCCACACCAG 


300 


CCCCAATCCA 


AAACTGGCAA 


GGACGCTTCA 


CAGGACAGGA 


AAGTGGCACC 


TGTCTGCTCC 


360 


AGCTCTGGCA 


TGGCTAGGAG 


GGAGTCGTCC 


CTTGAACTAC 


TGGGTGTAGA 


CTGGCCTGAA 


420 


CCACAGGAGA 


GGATGGCCCA 


GGGTGAGGTG 


GCATGGTCCA 


TTCTCAAGGG 


ACGTCCTCCA 


480 


ACGGGTGGCG 


CTAGAAAGGC 


CATGGAGGCA 


GTAGGACAAG 


GCGCAGGCAG 


GCTGGCCCGG 


540 


GGTCAGGCCG 


GGCAGGGCAC 


AGCGGGGTGA 


GAGGGATTCC 


TAATCACTCA 


GAGCAGTGTG 


600 


TGACTGGTAG 


TTAGGGACTC 


AGTGGACAGG 


GGAGGGGCGA 


GGGGGCAGGA 


GAAGAAAATG 


660 


TTCTTCCAGT 


TACTTTCCAA 


TTCTCCTTTA 


GGGACAGCTT 


AGAATTATTT 


GCACTATTGA 


720 


GTCTTCATGT 


TCCCACTTCA 


AAACAAACGA 


TGCTCTGAGA 


GCAAACTGGC 


TTGAATTGGT 


780 


GACATTTAGT 


CCCTCAAGCC 


ACCAGATGTG 


AGTGTTGAGA 


ACTACCTGGA 


TTTGTATATA 


840 




* 
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TACCTG 



846 



(2) INDICATIONS AS TO ID NO: 7: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 

(ii) KIND OF MOLECULE: cDNA 



(xi) 


SEQUENCE 


DESCRIPTION: SEQ ID 1 


MA • "7 - 
IN v_/ . / . 






TTGCTGCAGA 


TACTACTGAC 


CAGACAAGCT 


GTTGACCAGG 


CACTCCCCAC 


AACAACAACC 


60 


CCCTCCCTCC 


TCACCCCACC 


CCTATCCCCT 


GTGTGCTCAT 


TAGAGAGGGC 


AATTGAGAGG 


120 


ACACTCCCAT 


TTTTGGTGCC 


ACTGATGCCC 


TGTCCATAGC 


TTCCCTGACT 


TTTACACCAC 


180 


CCCAACTCCC 


AATCTGAGGG 


ACTGGGAGGT 


GTGACGCAGG 


AGAAACTATA 


TAGGACTCTT 


240 


GGGAGAAGAC 


TATAGAGTTG 


GCAAGTGATT 


GCGCCCCAGT 


AATTCCAACT 


GTGGTAGCAC 


300 


AAGTCTGGCT 


CCACACCAAC 


CCAATCCAAA 


ACTGACAAGG 


ACATTTTGCA 


AAAAATGAAA 


360 


GTGGCATTTG 


TCTGATCCAG 


CTCTGGCATG 


GCTAGAGATG 


AGTCTTAAAC 


TGTTGGCTTA 


420 


TAAACTGGCC 


TGAGCAACAG 


AAGAGGATGG 


CCCAGAGTAA 


AGTGTCATCA 


TCTGTTCACA 


480 


AGGCATGCTC 


CCCTAGAAGT 


TCATGCTAAA 


GAAGTGCCAT 


GGAGGCAGCA 


GGACAAAGTA 


540 


CAGGCTAGGT 


GGAGTCAAGC 


CAGGCCTAGT 


GCCACAGAGC 


AAGAGAGCAG 


TCTCTGACTA 


600 


GTAGTTAAGG 


GGGAAGAAAG 


AAAAATATTC 


TTCCAATTGC 


TTTCCAGTTC 


TCCTTTAGGG 


660 


ACAGCTTAGA 


ATTATTTGCA 


CTATTGAGTC 


TTCATGTTCC 


CACTTCAAAA 


CAAATAGATG 


720 


CTCTGAAAGC 


AAACTGGCTT 


GAAATGGTGA 


CACTGTCCCA 


CAAGCCACCA 


GACAATGGCA 


780 


GTGTTCAGAA 


CTACCTGTAT 


ATGTATATAC 


CTG 






813 



(2) INDICATIONS AS TO ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) KIND: nucleotide 

(C) STRAND FORM: not known 

(D) TOPOLOGY: not known 

(ii) KIND OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: . 8: 
TTGCTGCAGA TACTACTGAC CAGACAAGCT GTTGACCAGG CACCTCCCCT CCCGCCCAAA 
CCTTTCCCCC ATGTGGTCGT TAGAGACAGA GCGACAGAGC AGTTGAGAGG ACACTCCCGT 



60 
120 
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TTTCGGTGCC 


ATCAGTGCCC 


CGTCTACAGC 


TCCCCCAGCT 


CCCCCCACCT 


CCCCCACTCC 


180 


CAACCACGTT 


GGGACAGGGA 


GGTGTGAGGC 


AGGAGAGACA 


GTTGGATTCT 


TTAGAGAAGA 


240 


TGGATATGAC 


CAGTGGCTAT 


GGCCTGTGTG 


ATCCCACCCG 


TGGTGGCTCA 


AGTCTGGCCC 


300 


CACACCAGCC 


C C AATCC AAA 


ACTGGCAAGG 


ACGCTTCACA 


GGACAGGAAA 


GTGGCACCTG 


360 


TCTGCTCCAG 


CTCTGGCATG 


GCTAGGAGGG 


GGGAGTCCCT 


TGAACTACTG 


GGTGTAGACT 


420 


GGCCTGAACC 


ACAGGAGAGG 


ATGGCCCAGG 


GTGAGGTGGC 


GTGGTCCATT 


CTCAAGGGAC 


480 


GTCCTCCAAC 


GGGTGGCGCT 


AGAGGCCATG 


GAGGCAGTAG 


GACAAGGCGC 


AGGCAGGCTG 


540 


GCCCGGGGTC 


AGGCCGGGCA 


GAGCACAGCG 


GGGTGAGAGG 


GATTCCTAAT 


CACTCAGAGC 


gz r\ r\ 
600 


AGTCTGTGAC 


TTAGTGGACA 


GGGGAGGGGG 


CAAAGGGGGA 


GGAGAAGAAA 


ATGTTCTTCC 


660 


AGTTACTTTC 


CAATTCTCCT 


TTAGGGACAG 


CTTAGAATTA 


TTTGCACTAT 


TGAGTCTTCA 


720 


TGTTCCCACT 


TCAAAACAAA 


CAGATGCTCT 


GAGAGCAAAC 


TGGCTTGAAT 


TGGTGACATT 


780 


TAGTCCCTCA 


AGCCACCAGA 


TGTGACAGTG 


TTGAGAACTA 


CCTGGATTTG 


TATATATACC 


840 


TG 












842 



